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A METHOD FOR GENERATING HYPERMUTABLE ORGANISMS 

This invention was made using a U.S. government grant from the 
NIH (CA43460). Therefore, the U.S. government retains certain rights to 
the invention. 

TECHNICAL FIELD OF THE INVENTION 

The invention is related to the area of mismatch repair genes. In 
particular it is related to the field of mutagenesis. 

BACKGROUND OF THE INVENTION 

Within the past four years, the genetic cause of the Hereditary 
Nonpolyposis Colorectal Cancer Syndrome (HNPCC), also known as Lynch 
syndrome U, has been ascert^ned for the majority of kindreds aflFected v^th 
the disease (13). The molecular basis of HNPCC involves genetic instability 
resulting fi-om defective mismatch repair (MMR). To date, six genes have 
been identified in humans that encode for proteins and appear to participate 
in the MMR process, including the mutS homologs GTBP, hMSH2, and 
hMSH3 and the mutL homologs hMLHI, hPMSl, and hPMS2 
(2,7, 11, 17,20,21,22, 24). Germline mutations in four of these genes 
{hMSHl, hMLHl, hPMSl, and hPMS2) have been identified in HNPCC 



kindreds (2, 1 1, 13, 17,24). Though the mutator defect that arises from the 
MMR deficiency can affect any DNA sequence, microsatellite sequences are 
particularly sensitive to MMR abnormalities (14). Microsatellite instability 
is therefore a useful indicator of defective MMR. In addition to its 
occurrence in virtually all tumors arising in HNPCC patients, Microsattelite 
instability is found in a small fraction of sporadic tumors with distinctive 
molecular and phenotypic properties (27). 

HNPCC is inherited in an autosomal dominant fashion, so that the 
normal cells of affected family members contain one mutant allele of the 
relevant MMR gene (mherited from an affected parent) and one wild-type 
allele (inherited from the unaffected parent). During the early stages of 
tumor development, however, the wild-type allele is inactivated through a 
somatic mutation, leaving the cell with no functional MMR gene and 
resulting in a profound defect in MMR activity. Because a somatic mutation 
in addition to a germ-line mutation is required to generate defective MMR 
in the tumor cells, this mechanism is generally referred to as one involving 
"two hits," analogous to the biallelic inactivation of tumor suppressor genes 
that initiate other hereditary cancers (1 1, 13,25). In Ime with tWs two-hit 
mechanism, the non-neoplastic cells of HNPCC patients generally retain 
near normal levels of MMR activity due to the presence of the wild-type 
allele. 

SUMMARY OF THE INVENTION 

It is an object of the present invention to provide a method for 
rendering cells hypermutable. 

It is another object of the present invention to provide genetically 
altered cell lines. 

It is yet another object of the present invention to provide a method 
to produce transgenic animals that are hypermutable. 

It is also an object of the present invention to provide genetically 
altered transgenic animals. 



It is a further object of the invention to provide a method of 
mutating a gene of interest in a cell. 

Yet another object of the invention is to provide a method of 
mutating a gene of interest in an animal. 

These and other objects of the invention are provided by one or 
more of the embodiments described below. In one embodiment of the 
invention, a method for making a hypermutable cell is provided. A 
polynucleotide encoding a dominant negative allele of a mismatch repair 
gene is introduced into a ecU. The cell becomes hypermutable as a result of 
the introduction of the gene. 

In another embodiment of the invention, an isolated hypermutable 
cell is provided. The cell comprises a dominant negative allele of a 
mismatch repair gene. 

In another embodiment of the invention, a hypermutable transgenic 
animal is provided. The animal comprises a dominant negative allele of a 
mismatch repair gene. 

In another embodiment of the invention, a method is provided for 
introducmg a mutation into a gene of interest. A polynucleotide encoding a 
dominant negative allele of a mismatch repair gene is introduced into a cell. 
The cell becomes hypermutable as a result of the introduction of the gene. 
The cell fiirther comprises a gene of interest. The cell is grown. The cell is 
tested to determine whether the gene of interest harbors a mutation. 

In another embodiment of the invention, a method is provided for 
generating a mutation in a gene of interest. A transgenic animal comprising 
a polynucleotide encoding a donunant negative allele of a mismatch repair 
gene is grown. The animal comprises a gene of interest. The animal is 
tested to determine whether the gene of interest harbors a mutation. 

These and other embodiments of the invention provide the art with 
methods that can generate enhanced mutability in cells and animals as well 
as providing cells and animals harboring potentially useful mutations. 
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Figure 1. Diagrams of PMS2 expression vectors (Fig. lA) and pCAR 
reporters (Fig. IB). 

Figure 2. SH cells co-transfected with pCAR reporters and PMS2 
expression vectors after 17 days of drug selection. (Fig. 2 A) Western 
blots of lysates from untransfected SH cells (lane 1) or SH cells 
transfected with PMSl-NOT (lane 2) or PMS2-WT (lane 3). The 
arrow indicates the 1 10 kD protein expected for hPMS2. (Fig. 2B) 
Western blots of lysates from untransfected SH cells (lane 1) or SH 
cells transfected with PMS2'NOT Qme 2) or PMS2-134 (lane 3). The 
arrow mdicates the 14 kD protein expected for hPMS-134. Both A 
and B were probed with an antibody generated against the N- 
terminus of hPMS2. The upper polypeptides in A and the lower 
polypeptides in B represent cross-reactive hamster proteins. (Fig. 
2C)p-galactosidase activity in lysates derived from SH cells 
co-transfected with PM52-M?r (lane 1), PMS2-WT (}zxi^ 2), or 
PMS2'134 Oane 3) plus reporter plasmid. Relative P-galactosidase 
activities are defined as the ratio of P-galactosidase activity in cells 
transfected with pCAR-OF compared to that in cells transfected with 
pCAR-IF; this normalization controlled for transfection efiBdency and 
controlled for Q-galactosidase activity in the cells expressing the 
various PMS2 effector genes. 

Figure 3, In situ p-galactosidase activity of pooled clones of SH cells stably 
transduced with thePMS2-7VOr(Fig. 3 A), PMS2-WT(^\g, 3B), or 
PMS2-134 (Fig. 3C) expression vectors, then re-transfected with pCAR-OF 
reporter. After 17 days of drug selection, the colonies were pooled, 
cultured, and stained for P-galactosidase activity. A pooled culture of of 
PMS2-134 transduced SH cells expressing p-galactosidase from pCAR-OF 



is visible in Fig, 3C. The level of expression is lower, as expected, than in 
SH cells transfected with the pCAR-IF reporter plasmid, shown as a positive 
control in Fig. 3D. Each of the fields illustrated is representative of that 
found in triplicate experiments. 

Figure 4, Protein expression and P-galactosidase activity in stably 
transduced SH clones. (Fig. 4 A) Western blots of lysates fi-om clones 
stably transduced with PMS2-N0T (lanes 1-3) or PMS2-WT (lanes 4-6). 
(Fig, 4B) Western blots of lysates fi-om clones stably transduced with 
PMS2'NOT (lanes 1-3) or PMS2-1 34 (lanes 4-6). (The arrows indicate the 
polypeptide of the appropriate molecular weight. The upper (Fig. 4A) and 
lower (Fig, 4B) molecular weight polypeptides are nonspecific proteins. 
(Fig. 4C) The clones expressing PMS2-N0T Oane 1 A-3 A), PMS2-WT 
(lanes 1B-3B), or PMS2-134 Oanes 1C-3C) were transduced with 
pCAR-OF or pCAR-IF reporter plasmids and multiple subclones selected in 
hygromycin plus geneticin were harvested 17 days later and assayed for 
P-galactosidase activity. Relative P-galactosidase activities are defined as 
the ratio of P-galactosidase activity in cells transduced with pCAR-OF 
compared to that in cells transduced with pCAR-IF. 

Figure 5. Immunoprecipitation of in vitro translated hPMS2 and hMLHl 
protems. (Fig. 5 A) Labelled (indicated by an asterisk) or unlabelled 
proteins were incubated with an antibody to the C-terminus of hPMS2 in 
lanes 1-3 and to hMLHl in lanes 4-6. Lane 7 contains a nonprogrammed 
reticulocyte lysate. The contains codons 135-862 of The 

major translation products o{hPMS2 and hMLHl are indicated. (Fig. 5B) 
Labelled hPMS-134 (containing codons 1-134 of hPMS2) was incubated in 
the presense or absence of unlabelled hMLHl plus an antibody to hMLHl 
(lanes 1 and 2, respectively). Lane 3 contains lysate fi-om a nonprogrammed 
reticulolysate. (Fig. 5C) Labelled proteins were incubated with an antibody 
to the N-terminus of hPMS2. Lane 6 contains a nonprogrammed 



reticulocyte lysate. In both Fig. 5 A and Fig. 5B, autoradiographs of 
inununoprecipitated products are shown. 

Figure 6. Complementation of MMR activity in transduced SH cells. 
Lysates from pooled clones stably transduced with PMS2-NOT, PMSl-WT, 
or PMS2'134 were complemented with i^nxi&QdMutSa or MutLaMMK 
components using the 5*G/T heteroduplex substrate. The values are 
presented as the percentage of repsdr artivity in each case compared to that 
in lysates complemented with both purified MutLa and MutSa components 
to normalize for repair eflBciency in the different lysate backgrounds. The 
values shown represent the average of at least three different determinations. 

T^KTATLED DESCRIPTION OF THE TNVENTION 

The inventors have discovered a method for developing 
hypermutable cells and animals by taking advantage of newly discovered 
alleles of human mismatch repair genes. Dominant negative alleles of such 
genes, when introduced into cells or transgenic animals, increase the rate of 
spontaneous mutations by reducing the effectiveness of DNA repair and 
thereby render the cells or animals hypermutable. Hypermutable cells or 
animals can then be utilized to develop new mutations in a gene of interest. 

The process of mismatch repair, also called mismatch proofreading, 
is carried out by protein complexes in cells ranging from bacteria to 
mammalian cells. A mismatch repair gene is a gene that encodes one of the 
proteins of such a mismatch repair complex. Although not wanting to be 
bound by any particular theory of mechanism of action, a mismatch repair 
complex is believed to detect distortions of the DNA helix resulting from 
non-complementary pairing of nucleotide bases. The non-complementary 
base on the newer DNA strand is excised, and the excised base is replaced 
with the appropriate base which is complementary to the older DNA strand. 
In this way, cells eliminate many mutations which occur as a result of 
mistakes in DNA replication. 



Dominant negative alleles cause a mismatch repair defective 
phenotype even in the presence of a wild-type allele in the same cell. An 
example of a dominant negative allele of a mismatch repair gene is the 
human gene hPMS2-134, which carries a truncation mutation at codon 134. 
The mutation causes the product of this gene to abnormally terminate at the 
position of the 134th amino acid, resulting in a shortened polypeptide 
containing the N-terminal 133 amino acids. Such a mutation causes an 
increase in the rate of mutations which accumulate in cells after DNA 
replication. Expression of a dominant negative allele of a mismatch repair 
gene results in impairment of nusmatch repair activity, even in the presence 
of the wild-type allele. Any allele which produces such eflfect can be used in 
this invention. 

Dominant negative alleles of a mismatch repair gene can be obtained 
from the cells of humans, animals, yeast, bacteria, or other organisms. Such 
alleles can be identified by screening cells for defective mismatch repair 
activity. The cells may be mutagenized or not. Cells from animals or 
humans with cancer can be screened for defective mismatch repair. Cells 
from colon cancer patients may be particularly usefiil. Genomic DNA, 
cDNA, or mRNA from any cell encoding a mismatch repair protein can be 
analyzed for variations from the wild type sequence. Dominant negative 
alleles of a mismatch repair gene can also be created artificially, for example, 
by producing variants of the hPMS2-134 allele or other mismatch repair 
genes. Various techniques of site-directed mutagenesis can be used. The 
suitability of such alleles, whether natural or artificial, for use in generating 
hypermutable cells or animals can be evaluated by testing the mismatch 
repair activity caused by the allele in the presence of one or more wild-type 
alleles, to determine if it is a dominant negative allele. 

A cell or an animal into which a dominant negative allele of a 
mismatch repair gene has been introduced will become hypermutable. This 
means that the spontaneous mutation rate of such cells or animals is elevated 
compared to cells or animals without such alleles. The degree of elevation 



of the spontaneous mutation rate can be at least 2-fold, 5-fold, 10-fold, 20- 
fold, 50-fold, 100-fold, 200-fold, 500-fold, or 1000-fold that of the normal 
cell or animal. 

According to one aspect of the invention, a polynucleotide encoding 
a dominant negative form of a mismatch repair protein is introduced into a 
cell or a transgenic animal. The gene can be any dominant negative allele 
encoding a protein which is part of a mismatch repair complex, for example, 
PMS2, PMSl MLHl, or MSH2. The dominant negative aUele can be 
naturally occurring or made in the laboratory. The polynucleotide can be in 
the form of genomic DNA, cDNA, RNA, or a chemically synthesized 
polynucleotide. The polynucleotide can be introduced into the cell by 
transfection, 

Transfection is any process whereby a polynucleotide is introduced 
into a cell. The process of transfection can be carried out in a living animal, 
e.g.y using a vector for gene therapy, or it can be carried out in vitro, e.g., 
using a suspension of one or more isolated cells in culture. The cell can be 
any type of eukaryotic cell, including, for example, cells isolated from 
humans or other primates, mammals or other vertebrates, invertebrates, and 
single celled organisms such as protozoa or yeast. 

In general, transfection will be carried out using a suspension of 
cells, or a single cell, but other methods can also be applied as long as a 
suflScient fraction of the treated cells or tissue incorporates the 
polynucleotide so as to allow transfected cells to be grown and utilized. The 
protein product of the polynucleotide may be transiently or stably expressed 
in the cell. Techniques for transfection are well known. Available 
techniques for introducing polynucleotides include but are not limited to 
electroporation, transduction, cell fiision, the use of calcium chloride, and 
packaging of the polynucleotide together with lipid for fusion with the cells 
of interest. Once a cell has been transfected with the mismatch repair gene, 
the cell can be grown and reproduced in culture. If the transfection is stable, 



such that the gene is expressed at a consistent level for many cell 
generations, then a cell line results. 

An isolated cell is a cell obtained from a tissue of humans or animals 
by mechanically separating out individual cells and transferring them to a 
suitable cell culture medium, either with or without pretreatment of the 
tissue with enzymes, e.g., coUagenase or trypsin. Such isolated cells are 
typically cultured in the absence of other types of cells. Cells selected for 
the introduction of a dominant negative allele of a mismatch repair gene may 
be derived from a eukaryotic organism in the form of a primary cell culture 
or an immortalized cell line, or may be derived from suspensions of single- 
celled organisms. 

A polynucleotide encoding a dominant negative form of a mismatch 
repair protein can be introduced into the genome of an animal by producing 
a transgenic animal. The animal can be any species for which suitable 
techmques are available to produce transgenic animals. For example, 
transgenic animals can be prepared from domestic livestock, e,g,, cows, 
pigs, sheep, goats, horses, etc.; from animals used for the production of 
recombinant proteins, e.g., cows, pigs, or goats that express a recombinant 
protein in their milk; or experimental animals for research or product testing, 
e.g., mice, rats, hamsters, guinea pigs, rabbits, etc. 

Any method for making transgenic animals known in the art can be 
used. According to one process of producing a transgenic animal, the 
polynucleotide is injected into a fertilized egg of the animal and the injected 
egg is placed into a pseudo-pregnant female. The egg develops into a 
mature animal in which the polynucleotide is incorporated and expressed. 
The fertilized egg is produced in vitro from the egg and sperm of donor 
animals of the same species as the pseudo-pregnant female, who is prepared 
by hormone treatments to receive the fertilized egg and become pregnant. 
An alternative method for producing transgenic animals involves introducing 
the polynucleotide into embryonic cells by injection or transfection and 
reintroducing the embryonic cells into the developing embryo. With this 
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method, however, if the polynucleotide is not incorporated into germline 
cells, the gene will not be passed on to the progeny. Therefore, a transgenic 
animal produced by this method must be evaluated to determine whether the 
gene is incorporated into germ cells of the animal. Once transgenic animals 
are produced, they can be grown to reproductive age, when they can be 
mated to produce and maintain a colony of transgenic animals. 

Once a transfected cell line or a colony of transgenic animals has 
been produced, it can be used to generate new mutations in one or more 
gene(s) of interest. A gene of interest can be any gene naturally possessed 
by the cell line or transgenic animal or introduced into the cell line or 
transgenic animal. An advantage of using such cells or animals to induce 
mutations is that the cell or animal need not be exposed to mutagenic 
chemicals or radiation, which may have secondary harmfiil effects, both on 
the object of the exposure and on the workers. 

Mutations can be detected by analymig for alterations in the 
genotype of the cells or animals, for example by examining the sequence of 
genomic DNA, cDNA, messenger RNA, or amino acids associated with the 
gene of interest. Mutations can also be detected by screening the phenotype 
of the gene. A mutant phenotype can be detected by identifying alterations 
in electrophoretic mobility, spectroscopic properties, or other physical or 
structural characteristics of a protein encoded by a mutant gene. One can 
also screen for altered function of the protein in situ, in isolated form, or in 
model systems. One can screen for alteration of any property of the cell or 
animal associated with the fimction of the gene of interest. 

The above disclosure generally describes the present invention. A 
more complete understanding can be obtained by reference to the following 
specific examples which are provided herein for purposes of illustration 
only, and are not intended to limit the scope of the invention. 
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FXAMPLE 1 ! hPMS2'134 Encod eis ft Dominant Negative Mismatch 
Repair Protein. 

A profound defect in MMR was found in the normal cells of two 
HNPCC patients. That this defect was operative in vivo was demonstrated 
by the widespread presence of microsattelite instability in non-neoplastic 
cells of such patients. One of the two patients had a germ-line truncating 
mutation of the hPMS2 gene at codon 134 (the hPMS2-134 mutation), 
while the other patient had a small germ-line deletion within the hMLHl 
gene (26). These data thus contradicted the two-liit model generally 
believed to explain the biochemical and biological features of HNPCC 
patients. The basis for this MMR deficiency in the normal cells of these 
patients was unclear, and several potential explanations were offered. For 
example, it was possible that the second allele of the relevant MMR gene 
was mactivated in the germ-line of these patients through an undiscovered 
mechanism, or that unknovm mutations of other genes involved in the MMR 
process were present that cooperated with the known germ-line mutation. 
It is clear firom knock-out experiments in mice that MMR-deficiency is 
compatible with normal growth and development, supporting these 
possibilities (1,3,6). Alternatively, it was possible that the mutant alleles 
exerted a dominant negative effect, resulting in MMR deficiency even in the 
presence of the wild-type allele of the corresponding MMR gene and all 
other genes involved in the MMR process. To distinguish between these 
possibilities, we expressed the tmncated polypeptide encoded by the 
hPMS2'I34 muUtion in an MMR proficient cell Une and analyzed its affect 
on the cell's MMR activity. The results showed that this mutant could 
indeed exert a dominant negative effect, resulting in biochemical and genetic 
manifestations of MMR deficiency. 

The MMR proficient Syrian hamster TK tslS cell line (hereafter 
called SH cells) was cotransfected with various hPMS2 expression plasmids 
plus reporter constructs for assessing MMR activity. The hPMS2 
expression plasmids contained the normal hPMS2 gene product or the 
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truncated hPMS2 gene identified in the patient described above (PMS2-WT 
andPMS2-134, respectively, Fig. lA). An "empty" vector devoid of hPMS2 
sequences (PMS2-NOT, Fig. 1 A) served as an additional control. The 
reporter construct pCAR-OF (cut of frame) contained a hygromycin 
resistance gene plus a P-galactosidase gene containing a 29 bp out-of-fi-ame 
poly-CA tract at the 5* end of its coding region. The reporter construct 
pCAR-IF (in frame) was identical except that the poly-CA tract was 27 bp 
and therefore did not disrupt the P-galactosidase reading frame (Fig. IB), 
The pCAR-OF reporter would not generate P-galactosidase activity unless a 
frame-restoring mutation (/.e., insertion or deletion) arose following 
transfection. 

Three diflferent transfection schemes were used to evaluate the 
effects of the PMS2'134 mutation on SH cells. In the first scheme, 
the expression vectors plus the reporters were co-transfected together. 
Pools containing greater than 100 clones were generated following 
selection with hygromycin for 17 days and harvested for Western blot 
and P-galactosidase assays. SH cells transduced with PMS2-WT and 
PMS2-I34 synthesized polypeptides of the expected size, as assessed 
with anti-hPMS2 antibodies on Western blots (Fig. 2A and 2B). As 
expected, virtually no P-galactosidase activity was observed in SH 
ceUs transfected with the pCAR-OF reporter plus PMS2'N0T (Fig, 
2C). However, SH cells transfected with PMS2-J34 expressed 
considerable p-galactosidase activity, significantly more than those 
transfected with PMS2-WT (Fig. 2C). These results suggested that 
the truncated polypeptide encoded by the PMS2-134 construct 
perturbs the endogenous MMR machinery, resulting in deletions or 
insertions that restored the reading frame. The exact nature of these 
presumed deletions or insertions could not be assessed, as multiple 
copies of the reporter constructs were transduced under our 
conditions, and the wild type p-galactosidase sequence was in great 
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excess over the expected mutants, precluding their demonstration by 
direct sequencing. 

In the second scheme, SH cells were co-transfected with each 
of the PMS2 expression vectors plus the hygromycin-resistance 
plasmid pLHL4. Hygromycin resistant cultures containing greater 
than 100 clones were pooled and expanded. These cultures were then 
co-transfected with pCAR-IF or pCAR-OF reporters plus a separate 
plasmid allowing geneticin selection. Two weeks later, the pooled 
cells, each containing more than lOG colonies resistant to both 
hygromycin and geneticin, were stained with X-gal to assess 
P-galactosidase activity. As shown in Figure 3, the cultures 
transfected with PMS2-134 (panel C) contained many blue cells, while 
virtually no cells were blue in the cultures transfected with 
PMS2'NOT or PMS2-WT (panels A and B, respectively). In each 
case, transfection eflBciency was controlled by parallel transfections 
using pCAR-IF which also served as a control for p-galactosidase 
activity of cells expressing the various PMS2 effector genes, which 
resulted in similar p-galactosidase expression levels in all cases 
(example in Fig. 3D). Increases in p-galactosidase activity after 
PMS2-1S4 transfection compared to /MS2-ffT transfection were also 
observed when a similar experimental protocol was applied to the 
MMR-proficient human embryonic kidney cell line 293. These cells 
were cotransfected with the pCAR-OF plus the various PMS2 effector 
plasmids and selected for 17 days in hygromycin. At day 17, colonies 
were stained with X-gal to assess P-galactosidase activity and scored 
for P-galactosidase expressing cells. As shown in Table 1, only those 
cells expressing the PMS2-134 polypeptide expressed a detectable 
p-galactosidase activity. These data demonstrate a similar dominant 
negative effect of the hPMS2-134 protein in both rodent and human 
systems and validate the utility of the rodent system in these studies. 
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In the third scheme, SH cells were transfected with each of the 
PMS2 expression vectors as described for the second scheme, but 
individual clones , rather than pooled clones, were expanded following 
drug selection. Of twenty clones transfected with five 
were shown to express readily detectable levels of fiill-length PMS2 
proteins (examples in Fig. 4A, lanes 4-6). Similar analyses of twenty 
PMS2-134 clones revealed four clones which expressed truncated 
PMS2 polypeptides of the expected size (examples in Figure 4B, lanes 
4-6 ). Three clones expressing fiill-length or truncated PMS2 
proteins, as well as three randomly selected clones &om PMS2-NOT 
transfected cells (Figure 4A and 4B, lanes 1-3) were chosen for 
further analysis. The individual clones were tested for P-galactosidase 
activity following co-transfection with pCAR-OF plus the pNTK 
plasmid, as described above for the pooled clones. As shown in 
Figure 4C, each of the three clones (lanes 3 A-3C) expressing the 
truncated hPMS2 polypeptide yielded much higher P-galactosidase 
activities following transfection with pCAR-OF than did the clones 
expressing the fiill-length hPMS2 protein (lanes 2A-2C) or no hPMS2 
protein (lanes 1 A-IC). 
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Table 1. P-galactosidase expression of 293 clones transfected with 
pCAR-OF reporter construct plus PMS2 effector plasmids. 293 cells 
were cotransfected with the pCAR-OF P-galactosidase reporter 
plasmid plus the PMS2-NOT, 'WT, or ~J34 effector plasmids. 
Transfected cells were selected in hygromycin for 17 days and stained 
with x-gal for P-galactosidase activity (blue colored cells). The 
results below represent the mean +/- standard deviation of triplicate 
experiments. 

Sample Blue colonies White wlpnigg 

PMSl'NOT 0 +/- 0 17 +/- 2.7 

PMS2-WT 0 +/- 0 18 +/- 4.0 

PMS2-I34 15+/- 2,1 6 +/-2.1 
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Plasmids. The full-length wild-type hPMS2 cDNA was 
obtained from a human Hela cDNA library as described (18). An 
hPMS2 cDNA containing a termination codon at amino acid 134 was 
obtmned via RT-PCR from the patient in which the mutation was 
discovered (9). The cDNA fragments were cloned into the BamHI 
site into the pSG5 vector, which contains an SV40 promoter followed 
by an SV40 polyadenylation signal (8). The pCAR reporter vectors 
described in Fig. 1 were constructed as described in ref. 21 and 25, 

Cell lines and transfeetion. Syrian Hamster fibroblast Tk*tsl3 
cells were obtmned from ATCC and cultured as described (15). 
Stably transfected cell lines expressing hPMS2 were created by 
cotransfection of the PMS2 expression vectors and the pLHL4 
plasmid encoding the hygromycin resistance gene at a ratio of 3 :1 
(pCAR:pLHL4) and selected with hygromycm. Stably transfected ceU 
lines containing pCAR reporters were generated by co-transfection of 
pCAR vectors together with either pNTK plasmid encoding the 
neomycin resistance plasmid or with pLHL4. All transfections were 
performed using calcium phosphate as previously described (15). 

P-galactosidase assay. Seventeen days following transfeetion 
with pCAR, P-galactosidase assays were performed using 20 ng of 
protein in 45 mM 2-mercaptoethanol, ImM MgCl^, 0. 1 M NaP04 and 
0.6 mg/ml Chlorophenol red-p-D-galatopyranoside (CPRG, 
Boehringer Mannheim). Reactions were incubated for 1 hour, 
terminated by the addition of 0,5 M Na^COa, and analyzed by 
spectrophotometry at 576 nm (16). For in situ P-galactosidase 
staining, cells were fixed in 1% glutaraldehyde in PBS and incubated 
in 0.15 MNaCl, 1 mMMgCl^, 3 .3 mMK4Fe(CN)6, 3 .3 mM 
K3Fe(CN)6, 0.2% X-Gal for 2 hours at 37X. 
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EX AMPLE 2: hPMS2-13 4 Causes a Defect in MMR Activity 

The most likely explanation for the diflFerences in 
P-galactosidase activity between PMS2-WT and PMS2-134 
transfected cells was that the PMS2-134 protein disturbed MMR 
activity, resulting in a higher frequency of mutation within the 
pCAR-OF reporter and re-establishing the ORF. To du-ectly test the 
hypothesis that MMR was altered, we employed a biochemical assay 
for MMR with the individual clones described in Fig. 4. Nuclear 
extracts were prepared from the clones and incubated with 
heteroduplex substrates containing either a /CA\ insertion-deletion or 
a G/T mismatch under conditions described previously. The /CA\ and 
G/T heteroduplexes were used to test repair from the 3* and 5' 
directions, respectively. There was a dramatic diflFerence between the 
PMS2-134 expressing clones and the other clones in these assays 
(Table 2A). While all clones repaired substrates from the 3' direction 
(/CA\ heteroduplex), cells expressing the PMS2-134 polypeptide had 
very little 5* repair activity. A similar directional defect m mismatch 
repair was evident with pooled clones resulting &omPMS2'J34 
transfection, or when the heteroduplex contained a 2-4 base pair loop, 
examples of which are shown in Table 2B. A small decrease in MMR 
activity was observed in the 3' /CA\ PMS2-WT repair assays, perhaps 
a result of interference in the biochemical assays by overexpression of 
the PMS2 protein. No significant activity was caused by PMS2-WT in 
the in situ p-galactosidase assays (Fig. 3; Table 1), a result more likely 
to reflect the in vivo condition. 
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Table 2. Mismatch repair activity of nuclear extracts from SH clones 
(A) or pooled cultures (B). The extracts were tested for MMR 
acti^dty with 24 finol of heteroduplex. *These data represent similar 
results derived from greater than five independent experiments. 



A SH clones* 



PMS2-NOT 
clone A 
clone B 
clone C 

PMS2-WT 
clone A 
clone B 
clone C 

PMS2-134 
clone A 
clone B 
clone C 



Repaired substrate ffinol/15 min;^ 
3* /CA\ 5' G/T 



10.2 
12.7 
13.5 



2.8 
5.7 
4.7 



2.5 
2.4 
5.0 



3.5 
2.9 
5.5 



2.2 
4.8 
2.9 



0.0 
0.0 
0.5 



30 



35 



B. Pooled cultures 



40 



Cell Line 
PMS2-NOT 
PMS2-WT 
PMS2-134 



Repaired s ubstrate rfinol/lS min^ 
3'G/T 5'G/T 3'/CTG\ 5'/CTG\ 

2.07+/- 0.09 2.37+/- 0.37 3.45 +/- 1.35 2.77+/-I.37 
1.65+/- 0.94 1.86+/- 0.57 1.13+/- 0.23 1.23+/- 0.65 
0.14+/- 0.2 0.0+/- 0.0 1.31 +/-0.66 0.0 +/- 0.0 
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Westem blots. Equal number of cells were lysed directly in lysis 
buflfer (60 mM Tris, pH 6.8, 2% SDS, 10% glycerol, 0.1 M 
2-mercaptoethanol, 0.001% bromophenol blue) and boiled for 5 minutes. 
Lysate proteins were separated by electrophoresis on 4-12% Tris-glycine 
gels (for analysis of full-length hPMS2) or 4-20% Tris-glycine gels (for 
analysis of hPMS2-134). Gels were electroblotted onto Immobilon-P 
(Millipore) in 48 mM Tris base, 40 mM glycine, 0.0375% SDS, 20% 
methanol and blocked overnight at 4°C in Tris-buflfered saline plus 0.05% 
Tween-20 and 5% condensed milk. Filters were probed with a polyclonal 
antibody generated against residues 2 - 20 of hPMS2 (Santa Cruz 
Biotechnology, Inc.) and a horseradish peroxidase conjugated goat 
anti-rabbit secondary antibody, using chemilluminescence for detection 
(Pierce). 

In vitro translation. Linear DNA fragments containing hPMS2 and 
hMLHl cDNA sequences were prepared by PCR, incorporating sequences 
for in vitro transcription and translation in the sense primer. A full-length 
hMLHl fragment was prepared using the sense primer 
5-ggatcctaatacgactcactatagggagaccaccatgtcgttcgtggcaggg-3* (codons 1-6) 
and the antisense primer 5 -taagtcttaagtgctaccaac-3' (located in the 3' 
untranslated region, nt 241 1-2433), using a wild-type hMLHl cDNA clone 
as template. A fiill-length hPMS2 fragment was prepared with the sense 
primer 

5'-ggatcctaatacgactcactatagggagaccaccatggaacaattgcctgcgg-3' (codons 1-6) 
and the antisense primer 5 -aggttagtgaagactctgtc-3* (located in 3' 
untranslated region, nt 2670-2690) using a cloned hPMS2 cDNA as 
template. A fragment encoding the amino-terminal 134 amino acids of 
hPMS2 was prepared using the same sense primer and the antisense primer 
5'-agtcgagttccaaccttcg-3. A fragment containing codons 135 - 862 of 
hPMS135 was generated using the sense primer 
5 '-ggatcctaatacgactcactatagggagaccaccatgatgtttgatcacaatgg-3 • (codons 
135-141) and the same antisense primer as that used for the full-length 
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hPMS2 protein. These fragments were used to produce proteins via the 
coupled transcription-translation system (Promega). The reactions were 
supplemented with ^^S-labelled methionine or unlabelled methionine, as 
indicated in the text. The PMS135 and hMLHl proteins could not be 
simultaneously radiolabelled and immunoprecipitated because of their similar 
molecular weights precluded resolution. Lower molecular weight bands are 
presumed to be degradation products and/or polypeptides translated from 
alternative internal methionines. 

Immunoprecipitation. Immunoprecipitations were performed on in 
vitro translated proteins by mixing the translation reactions with 1 |ag of the 
MLHl specific monoclonal antibody (mAB) MLH14 (Oncogene Science, 
Inc.), a polyclonal antibody generated to codons 2-20 of hPMS2 described 
above, or a polyclonal antibody generated to codons 843-862 of hPMS2 
(Santa Cruz Biotechnology, Inc.) in 400 ^l of EEC buffer (50 mM Tris, pH 
7.5, 0. 1 M NaCl, 0,5% 1SIP40). After incubation for 1 hr at 4**C, protein A 
sepharose (Sigma) was added to a final concentration of 10% and reactions 
were incubated at A^'C for 1 hour. Proteins bound to protein A were washed 
five times in EEC and separated by electrophoresis on 4-20% Tris-glycme 
gels, which were then dried and autoradiographed. 

Eiochemical assays for mismatch rep^r. MMR activity in nuclear 
extracts was performed as described, using 24 finol of substrate (12,25). 
Complementation assays were done by adding - 100 ng of purified MutLa 
or MutSa components to 100 ^ig of nuclear extract, adjusting the final KCl 
concentration to 100 mM (4,10,30). The substrates used in these 
experiments contain a strand break 181 nucleotides 5' or 125 nucleotides 3' 
to the mismatch. Values represent experiments performed at least in 
duplicate. 
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FX AMPLE 3: Carboxv T erminus of hPMvS2 Mediates Tnteractipn 
between hPMS2 and hMLHl 

To elucidate the mechanism by which hPMS2-134 aflFected MMR, 
we analyzed the interaction between hPMS2 and hMLHl. Previous studies 
have shown that these two proteins dimerize to form a fiinctionally active 
complex (12, 28). Proteins were synthesized in vitro using reticulocyte 
lysates, employing RNA generated from cloned templates. The full-length 
hMLHl and hPMS2 proteins bound to each other and were co-precipitated 
with antibodies to either protein, as expected (data not shown). To 
determine the domain of hPMS2 which bound to hMLHl, the amino 
terminus (codons 1-134), containing the most highly conserved domam 
among mutL proteins (19,24), and the carboxyl temunus (codons 135 - 862) 
were separately cloned and proteins produced in vitro in coupled 
transcription-translation reactions. When a ^^S-labelled, fiill length hMLHl 
protein (Fig. 5 A, lane 5) was mbced vnth the unlabelled carboxyl terminal 
hPMS2 polypeptide, a monoclonal antibody (mAb) to the carboxyl terminus 
of hPMS2 efficiently immunoprecipitated the labeled hMLHl protein (lane 
1). No hMLHl protein was precipitated in the absence of hPMS2 (lane 2). 
Conversely, when the ^^S-labelled carboxyl-terminus of hPMS2 (lane 3) was 
incubated with unlabelled, fiill-length hMLHl protein, an anti-hMLHl mAb 
precipitated the hPMS2 polypeptide (lane 4). In the absence of the 
unlabelled hMLHl protein, no hPMS2 protein was precipitated by this mAb 
(lane 6). The same antibody failed to immunoprecipitate the amino-terminus 
of hPMS2 (amino acids 1-134) when mbced with unlabelled MLHl protein 
(Fig. 5B, lane 1). This finding was corroborated by the converse experiment 
in which radiolabelled hPMS2-134 (Fig. 5C, lane 1) was unable to 
coprecipitate radiolabelled MLHl when precipitations were done using an 
N-terminal hPMS2 antibody (Fig. 5C, lane 2) while this antibody was shown 
to be able to coprecipitate MLHl when mixed with wild-type hPMS2 (Fig. 
5C, lane 4). 
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The initial steps of MMR are dependent on two protein complexes, 
called MutSa and MutLa (14). As the amino terminus of hPMS2 did not 
mediate binding of hPMS2 to HMLHl, it was of interest to determine 
whether it might instead mediate the interaction between the MutLa 
complex (composed of hMLHl and hPMS2, ref 12) and the MutSa 
complex (composed of MSH2 and GTBP, ref 4). Because previous studies 
have demonstrated that MSH2 and the MutLa components do not associate 
in solution (28), we were unable to assay for direct hPMS2-134:MutSa 
interaction. We therefore used a different approach to address this issue, and 
attempted to complement nuclear extracts from the various SH cell lines 
with MutSa or MutLa. If the truncated protein present in the PMS2-134 
expressmg SH cells was binding to MutSa and lowering its effective 
concentration in the extract, then adding intact MutSa should rescue the 
MMR defect in such extracts. Purified MutSa added to such extracts had no 
effect (Fig. 6). In contrast, addition of intact MutLa to the extract 
completely restored directional repair to the extracts from PMS2-134 cells 
(Fig. 6). 

The results described above lead to several conclusions. First, 
expression of the amino-terminus of hPMS2 results in an increase in 
microsattelite instability, consistent with a replication error (RER) 
phenotype. That this elevated microsattelite instability is due to MMR 
deficiency was proven by evaluation of extracts from stably transduced cells. 
Interestingly, the expression of PMS2-134 resulted in a polar defect in 
MMR, which was only observed using heteroduplexes designed to test repair 
from the 5* direction (no significant defect in repair from the 3* direction was 
observed in the same extracts). Interestingly, cells deficient in hMLHl also 
have a polar defect in MMR, but in this case preferentially affecting repair 
from the 3' direction (5). It is known from previous studies in both 
prokaryotes and eukaryotes that the separate enzymatic components mediate 
repair from the two different directions. Our results, in combination with 
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those of Dnimmond et al, strongly suggest a model in which 5* repair is 
primarily dependent on hPMS2 while 3' rep^- is primarily dependent on 
hMLHl . It is easy to envision how the dimeric complex between PMS2 and 
MLHl might set up this directionality. The combined results also 
demonstrate that a defect in directional MMR is sufficient to produce a 
RER+ phenotype. 

We anticipated that the dominant negative function of the PMS2-134 
polypeptide resulted from its binding to MLHl and consequent inWbition of 
MutLa function. This hypothesis was based in part on the fact that the most 
highly conserved domain of the PMS2 gene is located in its amino termmus, 
and the only known biochemical partner for PMS2 is MLHl. Our binding 
studies revealed, however, that the carboxyl terminus of PMS2, rather than 
the highly conserved amino terminus, actually mediated binding to MLHl. 
This result is consistent with those recently obtained in 5. cerevisciae, in 
which the MLHl -interacting domain of PMSl (the yeast homolog of human 
PMS2) was localized to its carboxyl-terminus (23). Our add-back 
experiments additionally showed that the hPMS2-134 mutant was not likely 
to mediate an interaaion with the MutSa complex (Fig. 6). The best 
explanation at present to explain the various observations made here is that 
the hPMS2-134 polypeptide does not inhibit the initial steps in MMR, but 
rather interacts with and inhibits a downstream component of the pathway, 
perhaps a nuclease required for repair from the 5' direction. 

The demonstration that the hPMS2-134 mutation can confer a 
dominant negative MMR defect to transfected cells helps to explain the 
phenotype of the kindred in which this mutant was discovered. Three 
individuals from this kindred were found to carry the mutation, a father and 
his two children. Both children exhibited microsattelite instability in their 
normal tissues and both developed tumors at an early age, while the father 
had no evidence of microsattelite instability in his normal cells and was 
completely healthy at age 35. The only difference known to us with respect 
to the MMR genes in this family is that the father*s mutant allele was 
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expressed at lower levels than the wild-type allele as assessed by sequencing 
of reverse transcriptase-PCR products of RNA from lymphocytes. The 
children expressed both alleles at approximately equal levels (Parsons et al 
and unpublished observations). We suspect that the dominant negative 
attribute of the hPMS2'134 mutant will only be manifest when it is present at 
sufficient concentrations (at least equunolar), thus explaining the absence of 
MMR deficiency in the father. The reason for the differential expression of 
the hPMS2-134 allele in this kindred is not clear, though imprinting is a 
possibility. Hopefiilly, the ascertainment of additional, larger kindreds with 
such mutations will facilitate the investigation of this issue. 
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SEQUENCE LISTING 

(1) GENERAL INFORMATION 

(i) APPLICANT: NICOLAIDES, NICHOLAS 
VOGELSTEIN, BERT 
KINZLER, KINZLER 

(ii) TITLE OF THE INVENTION: A METHOD FOR GENERATING 
HYPERMUTABLE ORGANISMS 

(iii) NUMBER OF SEQUENCES: 2 

(iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: Banner & Witcoff 

(B> STREET: 1001 G Street, NW 

(C) CITY: Washington 

(D) STATE: DC 

(E) COUNTRY: USA 

(F) ZIP: 20001 

(V) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Diskette 

(B) COMPUTER: IBM Compatible 

(C) OPERATING SYSTEM: DOS 

(D) SOFTWARE: FastSEQ for Windows Version 2.0 

(vi) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: 

(B) FILING DATE: 

(C) CLASSIFICATION: 

(vii) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: 

(B) FILING DATE: 

(viii) ATTORNEY/AGENT INFORMATION: 

(A) NAME: Kagan, Sarah A 

(B) REGISTRATION NUMBER: 32141 

(C) REFERENCE /DOCKET NUMBER: 01107.73306 

(ix) TELECOMMUNICATION INFORMATION: 

(A) TELEPHONE: 202-508-9100 

(B) TELEFAX: 202-508-9299 

(C) TELEX: 



(2) INFORMATION FOR SEQ ID NO:l: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2771 base pairs 

(B) TYPE: nucleic acid 
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10 



25 



45 



(C) STRANDEDNESS: double 

( D ) TOPOLOGY : 1 inear 

(11) MOLECXJLE TYPE: cDNA 
(ix) FEATURE: 

(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 25..,2610 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:l: 



CGAGGCGGAT CGGGTGTTGC ATCC ATG GAG CGA GCT GAG AGC TCG AGT ACA 51 
15 Met Glu Arg Ala Glu Ser Ser Ser Thr 

1 5 

GAA CCT GCT AAG GCC ATC AAA CCT ATT GAT CGG AAG TCA GTC CAT CAG 99 
Glu Pro Ala Lys Ala He Lys Pro He Asp Arg Lys Ser Val His Gin 
20 10 15 20 2 5 

ATT TGC TCT GGG CAG GTG GTA CTG AGT CTA AGC ACT GCG GTA AAG GAG 147 
He Cys Ser Gly Gin Val Val Leu Ser Leu Ser Thr Ala Val Lys Glu 
30 35 40 



TTA GTA GAA AAC AGT CTG GAT GCT GGT GCC ACT AAT ATT GAT CTA AAG 195 
Leu Val Glu Asn Ser Leu Asp Ala Gly Ala Thr Asn He Asp Leu Lys 
45 50 55 



30 CTT AAG GAC TAT GGA GTG GAT CTT ATT GAA GTT TCA GAC AAT GGA TGT 243 

Leu Lys Asp Tyr Gly Val Asp Leu He Glu Val Ser Asp Asn Gly Cys 
60 65 70 

GGG GTA GAA GAA GAA AAC TTC GAA GGC TTA ACT CTG AAA CAT CAC ACA 291 
35 Gly Val Glu Glu Glu Asn Phe Glu Gly Leu Thr Leu Lys His His Thr 

75 80 85 

TCT AAG ATT CAA GAG TTT GCC GAC CTA ACT CAG GTT GAA ACT TTT GGC 339 
Ser Lys He Gin Glu Phe Ala Asp Leu Thr Gin Val Glu Thr Phe Gly 
40 90 95 100 105 

TTT CGG GGG GAA GCT CTG AGC TCA CTT TGT GCA CTG AGC GAT GTC ACC 387 
Phe Arg Gly Glu Ala Leu Ser Ser Leu Cys Ala Leu Ser Asp Val Thr 
110 115 120 



ATT TCT ACC TGC CAC GCA TCG GCG AAG GTT GGA ACT CGA CTG ATG TTT 43 5 

He Ser Thr Cys His Ala Ser Ala Lys Val Gly Thr Arg Leu Met Phe 
125 130 135 



50 GAT CAC AAT GGG AAA ATT ATC CAG AAA ACC CCC TAC CCC CGC CCC AGA 483 

Asp His Asn Gly Lys He He Gin Lys Thr Pro Tyr Pro Arg Pro Arg 
140 145 150 

55 
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GGG ACC ACA GTC AGC GTG CAG CAG TTA TTT TCC ACA CTA CCT GTG CGC 531 
Gly Thr Thr Val Ser Val Gin Gin Leu Phe Ser Thr Leu Pro Val Arg 
155 160 165 

5 CAT AAG GAA TTT CAA AGG AAT ATT AAG AAG GAG TAT GCC AAA ATG GTC 579 

His Lys Glu Phe Gin Arg Asn lie Lys Lys Glu Tyr Ala Lys Met Val 
170 175 180 185 

CAG GTC TTA CAT GCA TAG TGT ATC ATT TCA GCA GGC ATG CGT GTA AGT 627 
10 Gin Val Leu His Ala Tyr Cys lie lie Ser Ala Gly lie Arg Val Ser 

190 195 200 

TGC ACC AAT CAG CTT GGA CAA GGA AAA CGA CAG CCT GTG GTA TGC ACA 675 
Cys Thr Asn Gin Leu Gly Gin Gly Lys Arg Gin Pro Val Val Cys Thr 
15 205 210 215 

GGT GGA AGC CCC AGC ATA AAG GAA AAT ATC GGC TCT GTG TTT GGG CAG 723 
Gly Gly Ser Pro Ser lie Lys Glu Asn lie Gly Ser Val Phe Gly Gin 
220 225 230 



20 



40 



AAG CAG TTG CAA AGC CTC ATT CCT TTT GTT CAG CTG CCC CCT AGT GAC 771 
Lys Gin Leu Gin Ser Leu lie Pro Phe Val Gin Leu Pro Pro Ser Asp 
235 240 245 



25 TCC GTG TGT GAA GAG TAC GGT TTG AGC TGT TCG GAT GCT CTG CAT AAT 819 

Ser Val Cys Glu Glu Tyr Gly Leu Ser Cys Ser Asp Ala Leu His Asn 
250 255 260 265 

CTT TTT TAC ATC TCA GGT TTC ATT TCA CAA TGC ACG CAT GGA GTT GGA 867 
30 Leu Phe Tyr lie Ser Gly Phe lie Ser Gin Cys Thr His Gly Val Gly 

270 275 280 

AGG AGT TCA ACA GAC AGA CAG TTT TTC TTT ATC AAC CGG CGG CCT TGT 915 
Arg Ser Ser Thr Asp Arg Gin Phe Phe Phe lie Asn Arg Arg Pro Cys 
35 285 290 295 

GAC CCA GCA AAG GTC TGC AGA CTC GTG AAT GAG GTC TAC CAC ATG TAT 963 
Asp Pro Ala Lys Val Cys Arg Leu Val Asn Glu Val Tyr His Met Tyr 
300 305 310 



AAT CGA CAC CAG TAT CCA TTT GTT GTT CTT AAC ATT TCT GTT GAT TCA 1011 
Asn Arg His Gin Tyr Pro Phe Val Val Leu Asn lie Ser Val Asp Ser 
315 320 325 



45 GAA TGC GTT GAT ATC AAT GTT ACT CCA GAT AAA AGG CAA ATT TTG CTA 1059 

Glu Cys Val Asp lie Asn Val Thr Pro Asp Lys Arg Gin He Leu Leu 
330 335 340 345 

CAA GAG GAA AAG CTT TTG TTG GCA GTT TTA AAG ACC TCT TTG ATA GGA 1107 
50 Gin Glu Glu Lys Leu Leu Leu Ala Val Leu Lys Thr Ser Leu He Gly 

350 355 360 

ATG TTT GAT AGT GAT GTC AAC AAG CTA AAT GTC AGT CAG CAG CCA CTG 115 5 

Met Phe Asp Ser Asp Val Asn Lys Leu Asn Val Ser Gin Gin Pro Leu 
55 365 370 375 



33- 



CTG GAT GTT GAA GGT AAC TTA ATA AAA ATG CAT GCA GCG GAT TTG GAA 1203 
Leu Asp Val Glu Gly Asn Leu lie Lys Met His Ala Ala Asp Leu Glu 
380 385 390 

5 AAG CCC ATG GTA GAA AAG CAG GAT CAA TCC CCT TCA TTA AGG ACT GGA 1251 

Lys Pro Met Val Glu Lys Gin Asp Gin Ser Pro Ser Leu Arg Thr Gly 
395 400 405 

GAA GAA AAA AAA GAC GTG TCC ATT TCC AGA CTG CGA GAG GCC TTT TCT 1299 
10 Glu Glu Lys Lys Asp Val Ser lie Ser Arg Leu Arg Glu Ala Phe Ser 

410 415 420 425 

CTT CGT CAC ACA ACA GAG AAC AAG CCT CAC AGC CCA AAG ACT CCA GAA 1347 
Leu Arg His Thr Thr Glu Asn Lys Pro His Ser Pro Lys Thr Pro Glu 
15 430 435 440 

CCA AGA AGG AGC CCT CTA GGA CAG AAA AGG GGT ATG CTG TCT TCT AGC 1395 
Pro Arg Arg Ser Pro Leu Gly Gin Lys Arg Gly Met Leu Ser Ser Ser 
445 450 455 



20 



40 



ACT TCA GGT GCC ATC TCT GAC AAA GGC GTC CTG AGA CCT CAG AAA GAG 1443 
Thr Ser Gly Ala lie Ser Asp Lys Gly Val Leu Arg Pro Gin Lys Glu 
460 465 470 



25 GCA GTG AGT TCC AGT CAC GGA CCC AGT GAC CCT ACG GAC AGA GCG GAG 1491 

Ala Val Ser Ser Ser His Gly Pro Ser Asp Pro Thr Asp Arg Ala Glu 
475 480 485 

GTG GAG AAG GAC TCG GGG CAC GGC AGC ACT TCC GTG GAT TCT GAG GGG 1539 
30 Val Glu Lys Asp Ser Gly His Gly Ser Thr Ser Val Asp Ser Glu Gly 

490 495 500 505 

TTC AGC ATC CCA GAC ACG GGC AGT CAC TGC AGC AGC GAG TAT GCG GCC 1587 
Phe Ser lie Pro Asp Thr Gly Ser His Cys Ser Ser Glu Tyr Ala Ala 
35 510 515 520 

AGC TCC CCA GGG GAC AGG GGC TCG CAG GAA CAT GTG GAC TCT CAG GAG 1635 
Ser Ser Pro Gly Asp Arg Gly Ser Gin Glu His Val Asp Ser Gin Glu 
525 530 535 



AAA GCG CCT GAA ACT GAC GAC TCT TTT TCA GAT GTG GAC TGC CAT TCA 1683 
Lys Ala Pro Glu Thr Asp Asp Ser Phe Ser Asp Val Asp Cys His Ser 
540 545 550 



45 AAC CAG GAA GAT ACC GGA TGT AAA TTT CGA GTT TTG CCT CAG CCA ACT 1731 

Asn Gin Glu Asp Thr Gly Cys Lys Phe Arg Val Leu Pro Gin Pro Thr 
555 560 565 

AAT CTC GCA ACC CCA AAC ACA AAG CGT TTT AAA AAA GAA GAA ATT CTT 17 79 

50 Asn Leu Ala Thr Pro Asn Thr Lys Arg Phe Lys Lys Glu Glu lie Leu 

570 575 580 585 

TCC AGT TCT GAC ATT TGT CAA AAG TTA GTA AAT ACT CAG GAC ATG TCA 182 7 

Ser Ser Ser Asp lie Cys Gin Lys Leu Val Asn Thr Gin Asp Met Ser 
55 590 595 600 



4 
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GCC TCT CAG GTT GAT GTA GCT GTG AAA ATT AAT AAG AAA GTT GTG CCC 1875 
Ala Ser Gin Val Asp Val Ala Val Lys lie Asn Lys Lys Val Val Pro 
605 610 615 

5 CTG GAG TTT TCT ATG AGT TCT TTA GCT AAA CGA ATA AAG CAG TTA CAT 1923 

Leu Asp Phe Ser Met Ser Ser Leu Ala Lys Arg lie Lys Gin Leu His 
620 625 630 

CAT GAA GCA CAG CAA AGT GAA GGG GAA CAG AAT TAC AGG AAG TTT AGG 1971 
10 His Glu Ala Gin Gin Ser Glu Gly Glu Gin Asn Tyr Arg Lys Phe Arg 

635 640 645 

GCA AAG ATT TGT CCT GGA GAA AAT CAA GCA GCC GAA GAT GAA CTA AGA 2019 
Ala Lys lie Cys Pro Gly Glu Asn Gin Ala Ala Glu Asp Glu Leu Arg 
15 650 655 660 665 

AAA GAG ATA AGT AAA ACG ATG TTT GCA GAA ATG GAA ATC ATT GGT CAG 2067 
Lys Glu lie Ser Lys Thr Met Phe Ala Glu Met Glu lie lie Gly Gin 
670 675 680 



20 



40 



TTT AAC CTG GGA TTT ATA ATA ACC AAA CTG AAT GAG GAT ATC TTC ATA 2115 
Phe Asn Leu Gly Phe lie lie Thr Lys Leu Asn Glu Asp lie Phe lie 
685 690 695 



25 GTG GAC CAG CAT GCC ACG GAC GAG AAG TAT AAC TTC GAG ATG CTG CAG 2163 

Val Asp Gin His Ala Thr Asp Glu Lys Tyr Asn Phe Glu Met Leu Gin 
700 705 710 

CAG CAC ACC GTG CTC CAG GGG CAG AGG CTC ATA GCA CCT CAG ACT CTC 2211 
30 Gin His Thr Val Leu Gin Gly Gin Arg Leu lie Ala Pro Gin Thr Leu 

715 720 725 

AAC TTA ACT GCT GTT AAT GAA GCT GTT CTG ATA GAA AAT CTG GAA ATA 2259 
Asn Leu Thr Ala Val Asn Glu Ala Val Leu lie Glu Asn Leu Glu lie 
35 730 735 740 745 

TTT AGA AAG AAT GGC TTT GAT TTT GTT ATC GAT GAA AAT GCT CCA GTC 2307 
Phe Arg Lys Asn Gly Phe Asp Phe Val lie Asp Glu Asn Ala Pro Val 
750 755 760 



ACT GAA AGG GCT AAA CTG ATT TCC TTG CCA ACT AGT AAA AAC TGG ACC 235 5 

Thr Glu Arg Ala Lys Leu lie Ser Leu Pro Thr Ser Lys Asn Trp Thr 
765 770 775 



45 TTC GGA CCC CAG GAC GTC GAT GAA CTG ATC TTC ATG CTG AGC GAC AGC 2403 

Phe Gly Pro Gin Asp Val Asp Glu Leu lie Phe Met Leu Ser Asp Ser 
780 785 790 

CCT GGG GTC ATG TGC CGG CCT TCC CGA GTC AAG CAG ATG TTT GCC TCC 2451 
50 Pro Gly Val Met Cys Arg Pro Ser Arg Val Lys Gin Met Phe Ala Ser 

795 800 805 

AGA GCC TGC CGG AAG TCG GTG ATG ATT GGG ACT GCT CTT AAC AC A AGC 2 499 

Arg Ala Cys Arg Lys Ser Val Met lie Gly Thr Ala Leu Asn Thr Ser 
55 810 815 820 825 



-35- 

GAG AT6 AAG AAA CTG ATC ACC CAC ATG GGG GAG ATG 6AC CAC CCC TGG 2547 
Glu Met Lys Lye Leu lie Thr His Met Gly Glu Met Asp His Pro Trp 
830 835 840 

5 AAC TGT CCC CAT GGA AGG CCA ACC ATG AGA CAC ATC GCC AAC CTG GGT 2595 

Asn Cys Pro His Gly Arg Pro Thr Met Arg His lie Ala Asn Leu Gly 
845 850 855 

GTC ATT TCT CAG AAC TGACCGTAGT CACTGTATGG AATAATTGGT TTTATCGCAG A 2651 
10 Val lie Ser Gin Asn 

860 



15 



TTTTTATGTT TTGAAAGACA GAGTCTTCAC TAACCTTTTT TGTTTTAAAA TGAAACCTGC 2711 
TACTTAAAAA AAATACACAT CACACCCATT TAAAAGTGAT CTTGAGAACC TTTTCAAACC 2771 



(2) INFORMATION FOR SEQ ID NO: 2: 



(i) SEQUENCE CHARACTERISTICS; 
20 (A) LENGTH: 862 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

25 (ii) MOLECULE TYPE: protein 

(V) FRAGMENT TYPE: internal 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 

30 Met Glu Arg Ala Glu Ser Ser Ser Thr Glu Pro Ala Lys Ala lie Lys 

15 10 15 

Pro lie Asp Arg Lys Ser Val His Gin lie Cys Ser Gly Gin Val Val 

20 25 30 

Leu Ser Leu Ser Thr Ala Val Lys Glu Leu Val Glu Asn Ser Leu Asp 
35 35 40 45 

Ala Gly Ala Thr Asn lie Asp Leu Lys Leu Lys Asp Tyr Gly Val Asp 

50 55 60 

Leu lie Glu Val Ser Asp Asn Gly Cys Gly Val Glu Glu Glu Asn Phe 
65 70 75 80 

40 Glu Gly Leu Thr Leu Lys His His Thr Ser Lys lie Gin Glu Phe Ala 

85 90 95 

Asp Leu Thr Gin Val Glu Thr Phe Gly Phe Arg Gly Glu Ala Leu Ser 

100 105 110 

Ser Leu Cys Ala Leu Ser Asp Val Thr lie Ser Thr Cys His Ala Ser 
45 115 120 125 

Ala Lys Val Gly Thr Arg Leu Met Phe Asp His Asn Gly Lys lie lie 

130 135 140 

Gin Lys Thr Pro Tyr Pro Arg Pro Arg Gly Thr Thr Val Ser Val Gin 
145 150 155 160 

50 Gin Leu Phe Ser Thr Leu Pro Val Arg His Lys Glu Phe Gin Arg Asn 

165 170 175 

lie Lys Lys Glu Tyr Ala Lys Met Val Gin Val Leu His Ala Tyr Cys 

180 185 190 

lie lie Ser Ala Gly lie Arg Val Ser Cys Thr Asn Gin Leu Gly Gin 



-36- 



195 200 205 

Gly Lya Arg Gin Pro Val Val Cys Thr Gly Gly Ser Pro Ser lie Lys 

210 215 220 

Glu Asn lie Gly Ser Val Phe Gly Gin Lys Gin Leu Gin Ser Leu lie 
225 230 235 240 

Pro Phe Val Gin Leu Pro Pro Ser Asp Ser Val Cys Glu Glu Tyr Gly 

245 250 255 

Leu Ser Cys Ser Asp Ala Leu His Asn Leu Phe Tyr lie Ser Gly Phe 

260 265 270 

lie Ser Gin Cys Thr His Gly Val Gly Arg Ser Ser Thr Asp Arg Gin 

275 280 285 

Phe Phe Phe lie Asn Arg Arg Pro Cys Asp Pro Ala Lys Val Cys Arg 

290 295 300 

Leu Val Asn Glu Val Tyr His Met Tyr Asn Arg His Gin Tyr Pro Phe 
305 310 315 320 

Val Val Leu Asn lie Ser Val Asp Ser Glu Cys Val Asp lie Asn Val 

325 330 335 

Thr Pro Asp Lys Arg Gin lie Leu Leu Gin Glu Glu Lys Leu Leu Leu 

340 345 350 

Ala Val Leu Lys Thr Ser Leu lie Gly Met Phe Asp Ser Asp Val Asn 

355 360 365 

Lys Leu Asn Val Ser Gin Gin Pro Leu Leu Asp Val Glu Gly Asn Leu 

370 375 380 

lie Lys Met His Ala Ala Asp Leu Glu Lys Pro Met Val Glu Lys Gin 
385 390 395 400 

Asp Gin Ser Pro Ser Leu Arg Thr Gly Glu Glu Lys Lys Asp Val Ser 

405 410 415 

lie Ser Arg Leu Arg Glu Ala Phe Ser Leu Arg His Thr Thr Glu Asn 

420 425 430 

Lys Pro His Ser Pro Lys Thr Pro Glu Pro Arg Arg Ser Pro Leu Gly 

435 440 445 

Gin Lys Arg Gly Met Leu Ser Ser Ser Thr Ser Gly Ala lie Ser Asp 

450 455 460 

Lys Gly Val Leu Arg Pro Gin Lys Glu Ala Val Ser Ser Ser His Gly 
465 470 475 480 

Pro Ser Asp Pro Thr Asp Arg Ala Glu Val Glu Lys Asp Ser Gly His 

485 490 495 

Gly Ser Thr Ser Val Asp Ser Glu Gly Phe Ser lie Pro Asp Thr Gly 

500 505 510 

Ser His Cys Ser Ser Glu Tyr Ala Ala Ser Ser Pro Gly Asp Arg Gly 

515 520 525 

Ser Gin Glu His Val Asp Ser Gin Glu Lys Ala Pro Glu Thr Asp Asp 

530 535 540 

Ser Phe Ser Asp Val Asp Cys His Ser Asn Gin Glu Asp Thr Gly Cys 
545 550 555 560 

Lys Phe Arg Val Leu Pro Gin Pro Thr Asn Leu Ala Thr Pro Asn Thr 

565 570 575 

Lys Arg Phe Lys Lys Glu Glu lie Leu Ser Ser Ser Asp lie Cys Gin 

580 585 590 

Lys Leu Val Asn Thr Gin Asp Met Ser Ala Ser Gin Val Asp Val Ala 

595 600 605 

Val Lys lie Asn Lys Lys Val Val Pro Leu Asp Phe Ser Met Ser Ser 

610 615 620 

Leu Ala Lys Arg lie Lys Gin Leu His His Glu Ala Gin Gin Ser Glu 
625 630 635 640 
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Gly Glu Gin Aan Tyr Arg Lys Phe Arg Ala I*ys lie Cys Pro Gly Glu 

645 650 655 

Aon Gin Ala Ala Glu Asp Glu Leu Arg Lys Glu lie Ser Lys Thr Met 

660 665 670 

Phe Ala Glu Met Glu lie lie Gly Gin Phe Asn Leu Gly Phe lie lie 

675 680 685 

Thr Lys Leu Asn Glu Asp He Phe lie Val Asp Gin His Ala Thr Asp 

690 695 700 

Glu Lys Tyr Asn Phe Glu Met Leu Gin Gin His Thr Val Leu Gin Gly 
705 710 715 720 

Gin Arg Leu He Ala Pro Gin Thr Leu Asn Leu Thr Ala Val Asn Glu 

725 730 735 

Ala Val Leu He Glu Asn Leu Glu He Phe Arg Lys Asn Gly Phe Asp 

740 745 750 

Phe Val He Asp Glu Asn Ala Pro Val Thr Glu Arg Ala Lys Leu He 

755 760 765 

Ser Leu Pro Thr Ser Lys Asn Trp Thr Phe Gly Pro Gin Asp Val Asp 

770 775 780 

Glu Leu He Phe Met Leu Ser Asp Ser Pro Gly Val Met Cys Arg Pro 
785 790 795 800 

Ser Arg Val Lys Gin Met Phe Ala Ser Arg Ala Cys Arg Lys Ser Val 

805 810 815 

Met He Gly Thr Ala Leu Asn Thr Ser Glu Met Lys Lys Leu He Thr 

820 825 830 

His Met Gly Glu Met Asp His Pro Trp Asn Cys Pro His Gly Arg Pro 

835 840 845 

Thr Met Arg His He Ala Asn Leu Gly Val He Ser Gin Asn 
850 855 860 



